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Stock price changes occur through transactions, just as diffusion in physical systems occurs 
through molecular collisions. We systematically explore this analogy Jlj and quantify the rela- 
tion between trading activity — measured by the number of transactions NAt — an d the price 
change Ga* for a given stock, over a time interval [t, t + At]. To this end, we analyze a database 
documenting every transaction for 1000 US stocks over the two-year period 1994-1995 |^ . We find 
that price movements are equivalent to a complex variant of diffusion, where the diffusion coefficient 
fluctuates drastically in time. We relate the analog of the diffusion coefficient to two microscopic 
quantities: (i) the number of transactions NAt in At, which is the analog of the number of collisions 
and (ii) the local variance w\ t of the price changes for all transactions in At, which is the analog 
of the local mean square displacement between collisions. We study the distributions of both N^t 
and WAt, and find that they display power-law tails. Further, we find that N^t displays long-range 
power-law correlations in time, whereas WAt does not. Our results are consistent with the inter- 
pretation that the pronounced tails of the distribution of Ga* [pj~~P"2[ are due to WAt, and that the 
long-range correlations previously found p3Hf7| for |Ga*| are due to NAt- 
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Consider the diffusion Jl 8| — 120[| of an ink particle in wa- 
ter. Starting out from a point, the ink particle undergoes 
a random walk due to collisions with the water molecules. 
The distance covered by the particle after a time At is 
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N\tw\ v where the local 
2 ) is the variance 



where Sxi are the distances that the particle moves in be- 
tween collisions, and N\ t denotes the number of collisions 
during the interval At. The distribution P(xAt) is Gaus 
sian with a variance (x At ) = 
mean square displacement w\ t = {{5xi 
of the individual steps Sxi in the interval [t, t + At]. 

For the classic diffusion problem considered above: (i) 
the probability distribution P(NAt) is a "narrow" Gaus- 
sian, i.e., has a standard deviation much smaller than 
the mean (NAt), (ii) the time between collisions of an 
ink particle are not strongly correlated, so N& t at any 
future time t + r depends at most weakly on N& t at 
time t — i.e., the correlation function (NAt(t)NAt(t + r)) 
has a short-range exponential decay, (iii) the distribution 
P(wAt) is also a narrow Gaussian, (iv) the correlation 
function (wAt{t)wAt(t + r)) has a short-range exponen- 
tial decay and (v) the variable e = XAt/(wAt VAaT) is 
uncorrelated and Gaussian-distributed. These conditions 
result in XAt being Gaussian distributed and weakly cor- 
related. 

An ink particle diffusing under more general condi- 
tions would result in a quite different distribution of XAt , 
such as in a bubbling hot spring, where the character- 
istics of bubbling depend on a wide range of time and 
length scales. In the following, we will present empirical 



evidence that the movement of stock prices is equiva- 
lent to a complex variant of classic diffusion, specified 
by the following conditions: (i) P(A^At) is not a Gaus- 
sian, but has a power-law tail, (ii) N&t has long-range 
power-law correlations, (iii) P(wAt) is not a Gaussian, 
but has a power-law tail, (iv) the correlation function 
(wAt(t)wAt(t + t)) is short ranged, and (v) the variable 
e = x At / {w At^/ N At) is Gaussian distributed and short- 
range correlated. Under these conditions, the statistical 
properties of XAt will depend on the exponents charac- 
terizing these power laws. 

Just as the displacement XAt of a diffusing ink particle 
is the sum of NAt individual displacements Sxi, so also 
the stock price change GAt is the sum of the price changes 
Spi of the NAt transactions in the interval [t, t + At] , 
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Figure la shows NAt f° r classic diffusion and for one 
stock (Exxon Corporation). The number of trades for 
Exxon displays several events the size of tens of standard 
deviations and hence is inconsistent with a Gaussian pro- 
cess [§|-§e|. 

(i) We first analyze the distribution of NAt- Figure [j]c 
shows that the cumulative distribution of A^At displays 
a power- law behavior P{N > x} ~ x^ 13 . For the 1000 
stocks analyzed, we obtain a mean value (3 — 3.40 ± 0.05 
(Fig. |l|d). Note that /3 > 2 is outside the Levy stable 
domain < [3 < 2. 

(ii) We next determine the correlations in Aa*. We find 
that the correlation function (NAt{t)NAt(t + t)) is not 
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exponentially decaying as in the case of classic diffusion, 
but rather displays a power-law decay (Fig. |l]e,f). This 
result quantifies the qualitative fact that if the trading 
activity (NAt) is large at any time, it is likely to remain 
so for a considerable time thereafter. 

(iii) We then compute the variance w At = ((5pi) 2 ) of 
the individual changes Spi due to the Aa* transactions in 
the interval [t, t+At] (Fig.||a). We find that the distribu- 
tion P(wAt) displays a power-law decay P{wAt > x} ~ 
x -i (Fig. 0b). For the 1000 stocks analyzed, we obtain 
a mean value of the exponent 7 = 2.9 ±0.1 (Fig. ||c). 

(iv) Next, we quantify correlations in WAt- We find 
that the correlation function (wAt(t) WAt(t + t)) shows 
only weak correlations (Fig. ||d,e). This means that WAt 
at any future time t + r depends at most weakly on WAt 
at time t. 

(v) Consider now Spi chosen only from the interval 
[t, t + At], and let us hypothesize that these 5pi are 
mutually independent and with a common distribution 
P(Spi\t G [i,i + At]) having a finite variance w At . Under 
this hypothesis, the central limit theorem implies that 
the ratio 



distribution P{NAt > x} 
Therefore, P{\/Aa7 > x} 



x-P with j3 w 3.4 (Fig.0c,d). 
- x~ 2f} with 2(3 « 6.8. Equa- 
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must be a Gaussian-distributed random variable with 
zero mean and unit variance [ p7{ . Indeed, for classic dif- 
fusion, XAt/(wAtV Na) is Gaussian-distributed and un- 
correlated (Fig. ||a). We confirm this hypothesis by an- 
alyzing (a) the distribution P(e), which we find to be 
consistent with Gaussian behavior (Fig. ||b), and (b) the 
correlation function (e(t) e(i + r)), for which we find only 
short-range correlations (Fig. ^Jc,d) . 

Thus far, we have seen that the data for stock price 
movements support the following results: (i) the dis- 
tribution of NAt decays as a power-law, (ii) NAt has 
long-range correlations, (iii) the distribution of WAt 
decays as a power-law, (iv) WAt displays only weak 
correlations, and (v) the price change Ga* at any 
time is consistent with a Gaussian-distributed random 
variable |^l| ^4],^8 31 with a time-dependent variance 
NAtW At , that is, the variable e = G At I [w At\/ N At) is 
Gaussian-distributed and uncorrelated. 

Next, we explore the implications of our empirical 
findings. Namely, we show how the statistical proper- 
ties p|-|T5|] of price changes Ga* can be understood in 
terms of the properties of N&t and WAt- We will ar- 
gue that the pronounced tails of the distribution of price 
changes fi|-|l2|] are largely due to WAt and the long-range 
correlations previously found jl3]-[l7) for |Ga*| are largely 
due to the long-range correlations in NAt- By contrast, 
in classic diffusion NAt and WAt do not change the Gaus- 
sian behavior of a; At because they have only uncorrelated 
Gaussian-fluctuations [^l],|7j ■ 

Consider first the distribution of price changes GAt, 
which decays as a power-law P{Ga* > x} ~ x~ a with 
an exponent a ~ 3 ]^-|l^]. Above, we reported that the 



tion then implies that NAt alone cannot explain the 
value a ~ 3. Instead, a w 3 must arise from the distri- 
bution of WAt 1 which indeed decays with approximately 
the same exponent 7 « a ~ 3 (Fig. |b,c). Thus the 
power-law tails in P(Ga*) appear to originate from the 
power-law tail in P(u>At)- 

Next, consider the long-range correlations found for 
|Ga*| |13-M. Above, we reported that N^t displays 
long-range correlations, whereas WAt does not (Figs. 1- 
2). Therefore, the long range correlations in |Ga*| should 
arise from those found in Aa*. Hence, while the power- 
law tails in P(Ga*) are due to the power-law tails in 
P(wAt), the long-range correlations of |GAt| are due to 
those of Aa*. 

In sum, we have shown that stock price movements 
are analogous to a complex variant of classic diffusion. 
Further, we have empirically demonstrated the relation 
between stock price changes and market activity, i.e., 
the price change at any time GAt(t) is consistent with a 
Gaussian-distributed random variable with a local vari- 
ance NAt w At . What could be the interpretations of our 
results for the number of transactions Aa* and the local 
standard deviation WAt? Since AAt measures the trading 
activity for a given stock, it is possible that its power-law 
distribution and long-range correlations may be related 
to "avalanches" [j32|-[35l| . The fluctuations in WAt reflect 
several factors: (i) the level of liquidity of the market, (ii) 
the risk-aversion of the market participants and (iii) the 
uncertainty about the fundamental value of the asset. 
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FIG. 1. Statistical properties of N& t - a) The lower panel 
shows A^At for Exxon Corporation with At = 30 min and the 
average value (NAt) ~ 52. The upper panel shows a sequence 
of uncorrelated Gaussian random numbers with the same 
mean and variance, which depicts the number of collisions 
in NAt for the classic diffusion problem. Note that in con- 
trast to diffusion, NAt for Exxon shows frequent large events 
of the magnitude of tens of standard deviations, which would 
be forbidden for Gaussian statistics, b) The histogram of the 
average time interval between trades (St) for the 1000 stocks 
studied. In order to ensure that the sampling time interval 
At for each stock contains sufficient number of transactions, 
we partition the stocks into 6 groups (I- VI) based on (St). 
For a specific group, we choose a sampling time At at least 
10 times larger than the average value of (St) for that group. 
We choose the sampling time interval At = 30, 39, 65, 78, 130 
and 190 min respectively for groups I-VI. c) Log-log plot of 
the cumulative distribution of NAt for the stocks in each of 
the six groups in b). Since each stock has a different aver- 
age value of (NAt), we use a normalized number of transac- 
tions HAt = NAt /{NAt)- Each symbol shows the cumulative 
distribution P{nAt > x} of the normalized number of trans- 
actions nAt for all stocks in each group, d) The histogram 
of exponents obtained by fits to the cumulative distributions 
P{N A t > x} for each of the 1000 stocks. For the 1000 stocks 
studied, we obtain an average value j3 = 3.40. We calculate 
an error estimate ±0.05 by dividing the standard deviation of 
the estimates of f3 by VlOOO. 
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e) In order to accurately quantify time correlations 
in N^t, we use the method of detrended fluctuations ||(| 
often used to obtain accurate estimates of power-law cor- 
relations. We plot the detrended fluctuations F(t) as a 
function of the time scale r on a log-log scale for each of 
the 6 groups. Absence of long-range correlations would 
imply F(t) ~ r 5 , whereas F(t) ~ t v with 0.5 < v < 1 
show power-law correlations with long-range persistence. 
For each group, we plot F(t) averaged over all stocks in 
that group. In order to detect genuine long-range correla- 
tions, the U-shaped intraday pattern for N^t bas been re- 
moved by dividing each N& t by the intraday pattern |L4] . 
For 0.5 < v < 1.0, correlation function exponent u c f and 
v are related through v c f = 2 — 2v. f ) The histogram of 
the exponents v obtained by fits to F(j) for each of the 
1000 stocks shows a relatively narrow spread of v around 
the mean value v = 0.85 ± 0.01. 
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FIG. 2. Statistical properties of WAt- a) The local stan- 
dard deviation WAt computed from price changes Spi due to 
every transaction in the interval [t, t + At] for Exxon Cor- 
poration (lower panel) in contrast to uncorrelated Gaussian 
random numbers with the same mean value (wAt) ~ 0.08 and 
variance (upper panel). The time series of WAt for Exxon 
shows a number of large events of the size of tens of standard 
deviations. Intervals having fewer than 4 transactions are not 
used for computing WAt- Note that the large values of NAt in 
Fig. [j^, do not correspond to large values of WAt, showing that 
NAt and wai are weakly, if at all, correlated, b) Log-log plot 
of the cumulative distribution of WAt for each of the six groups 
defined in Fig. [ljb. Since the average value (id a*) changes from 
one stock to another, we normalize WAt by (wAt)- Each sym- 
bol shows the cumulative distribution of the normalized wai 
for all stocks in each group, c) The power law exponents for 
the cumulative distribution of WAt obtained by fits to the cu- 
mulative distributions of each of the 1000 stocks separately. 
We obtain an average value 7 = 2.9 ± 0.1. d) Log-log plot of 
the detrended fluctuation F(t) as a function of the time lag 
r. Each symbol shows F(t) averaged over all stocks in each 
group, e) The histogram of detrended fluctuation exponents 
obtained by fitting F(t) for each stock separately. We obtain 
an average value fi = 0.60 ± 0.01. 
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FIG. 3. Statistical properties of e. a) The time series of 
e = G At I '(w At V w At) for Exxon Corporation (lower panel) 
in contrast with a sequence of uncorrelated Gaussian ran- 
dom numbers with the same mean and variance which de- 
picts XAt/(wAt\/NAt) for classic diffusion (upper panel), b) 
The positive tail of the cumulative distribution of e for the 
six groups. We normalize e by its standard deviation in order 
to compare different stocks. Each symbol shows the cumula- 
tive distributions of the normalized e for all stocks in each of 
the six groups. The negative tail (not shown) displays similar 
behavior, c) Log-log plot of the detrended fluctuation F(t) 
averaged for all stocks belonging to each of the six groups, d) 
The histogram of detrended fluctuation exponents obtained 
by fits to F(t) for each stock. We obtain the average value 
r) = 0.48 ± 0.06. 
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